An Investigation of Indexing on the WWW

نویسندگان

  • Padmini Srinivasan
  • Miguel E. Ruiz
  • Wai Lam
چکیده

We propose a model that assists in understanding indexing on the World Wide Web (WWW). This model speciies key feature of indexing strategies that are currently being used. We also present an experiment assessing the validity of Inverse Document Frequency (IDF) as a term weighting strategy for WWW documents. The experiment indicates that IDF scores are not stable in the heterogeneous and dynamic context of the WWW. This research recommends further investigation to clarify the eeectiveness of alternative indexing strategies for the WWW.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Investigation on Crash Worthiness of Different Vehicle Brands: A Case Study of Rollover Crashes

This study aimed at indexing crash worthiness capability of 20 most frequently used car brands in Iran. Since rollover crashes are one of the most important crash types due to their high impact on crash severity, they were chosen as the case study of the current research. In this regard, the data of 42,118 rollover crashes of urban and rural roads of Iran which occurred from 2009 to 2012 was us...

متن کامل

Arch White Paper

Arch is an extension of the Apache Nutch, designed for efficient and effective indexing and search of organisational web sites (intranets). The corporate environment has a few very distinct characteristics as opposed to the global Web, for which Nutch was originally designed. Obviously, requirements to scalability are sufficiently lower when indexing an intranet than when indexing the WWW. On t...

متن کامل

A Comparing between the impacts of text based indexing and folksonomy on ranking of images search via Google search engine

Background and Aim: The purpose of this study was to compare the impact of text based indexing and folksonomy in image retrieval via Google search engine. Methods: This study used experimental method. The sample is 30 images extracted from the book “Gray anatomy”. The research was carried out in 4 stages; in the first stage, images were uploaded to an “Instagram” account so the images are tagge...

متن کامل

A Unified Approach to Indexing Multimedia on the Web

Indexing multimedia Web documents can be regarded as an important part of Web engineering, a concept first proposed [19] by one of the authors and his collaborators in 1998 at the World Wide Web WWW7 conference in Brisbane, Australia. Contentbased indexing of multimedia has always been a challenging task. The enormity and diversity of the multimedia content on the World Wide Web (WWW) adds anot...

متن کامل

Teaching on the WWW: Assignment Focus and Information Indexing

Seventeen students completed a course in which no face-to-face meetings and no paper exchanged hand. All information was shared either on the WWW or by email. In the first few weeks, extensive email dialogue occurred about the method of learning but after that the students focused exclusively on the content of the course. The first readings and exercises gave the students much freedom of choice...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007